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ABSTRACT 

A celatively siiple procedure for acdeling periodic 
components in t^ie aeries ^ata presented in this paper, along witfe 
an ejcaaple of the procedure's use with coaiup ication data.- Similar to 
■ultiple regression analysis, the described procedure has four steps 
that are based on ' infer mation about periodic waves and their 
components, how to create models cf wave components by using least 
.s<iuare« estimatiotos of amplitmjd* and phase, the estimation of 
frequency, an algorithm for computing estimates in multiple component 
models, significance testing with the«e estinTates, and ijarmonic 
analysis. The paper reports on the use. of the procedure, on data ' 
concerning newjs stories about the Concorde supersonic transport ijn 
the Washington, D. c. , mass mediA, noting that tte procedure's 
adjustments to hypotheses about intervals between Ccncorde stories 
helped predict cycles in the dat.a that accounted for large amounts of 
variance. The paper concludes that as communication data becomes 
increasingly time based, procedures ^uch as the one 'described should, 
become more prominent. (PL) 
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• A.' , ' ' ' • 

. •• Vg'RlOOiC .COHf^NKNTS IN COMMONICATION DATA: 

MODELS AND HYPOTHSSIS TESTING 

■ . INTRODU.CTI,ON • * 

this p»p»r< wfll oono«rn itatlrf with soac soluUons to a 

\ • ■ 

(foim^on pr^lM which «ri«ta in handling' tia« aifflds datti the 
OKJtlellng and tasting of.' (lypotheaes oonoarning ^ariodlo rises 
and falls in tht laval «f a single variable aeasufTed over time. 
With ooauiuniQation rtaearohers inoreaaingly using, over-time 
^jcU^a to repreis^nt aynaaio. ooawuijifati^n processes (e.g.. JCrull 
and Paulson, 1977; .Arundale, 1977), discussions of alternative 
techniques for xiesorlbing tirxi testing hypotheses oonoernlng " 
oycilo repetition in data are n€(%ded. 

^ : / - ■ * 

5unioation c}ata is particularly likely to exhibit a 
nature. Mass media coverage of a contr^oversial 
rten appears for 'a while , ta^er^ff, then again 
reappears, in a cyclic pattern. Audience preference for ^ 
particular television content is />ften cyclic; .as the periodic 
reappearance and diiaj^pearanoe of Westerns, coaedies, etc., 
testifies. The total amount of news a newspaper carries is 
cyclic over a week, because of factors like slow news <Uys over' 
the weekend, trhe large news hole in T;iursdj|y ^peri oreaied by^ 
sup#M|iirket advertising, and other* environaental variables. 

. There are two general reasons • to or^eate a model of , a7 
oyolioally ohaniin^ time ser.iea Qf oommunioation data; first, 
the oyblio ohangH might be theoretically important. An a- 



f 
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priori hypothetis predicting the frequency And/or the arapJitude 

0^ 0 hinge over time could be set up and ■ atetistioal ly tested; 

Thia i« the analog of predicting a linear relationship between 

. two aeta^ of static data, and testing the significance of a 

fitted regreaaion line. . 

.The second rtaaon to create a periodic ^ model is to 

describe, in sfaplified terms, an^exterr.sive data stft. Thiols 

parallel to using linear regression* to a, iiaaliriflCi describe 
■ r . '\ \ 

the relationship between two ■«tati,o variables. 

V . ' ■ . * ' 

* There are two general tecKniques by which models of^irae 
serie^can be created. The time doq^^n tJiohniques, .based on 
linear regreision concept* and methods for the moat part (cf. 
Box and Jenkins, 1970; Ostrom, 1978),, yield linpar models in 
which varii^6le values at one point in timrf predict subsequent 
values. Time domain modeling is the more, prevalent technique 
in cooaunioatiot. research, but it is somewhat- more difficult to 
use in cer^tai^!, situations than the alteri^atiye .method, 
fracy i f^yt QY K^OfflAi^^^odeH ng . 

■ ' Frw«noy doaain models fit periojjic sinusoid (sine arid 
ooaint/ funotioni to the observed data Ip) much the same wf^y 
lirifar r^greasioh fit» straight line funotibns to -data.. 
Multiple f^inotiont .can be addltively *fitted to prpduoe ooaplex ^ 
vtrittlont ^ver ti«e. In oontraat to tine doaain ■6ciel•^ which- 
product prediotiofi equation! which art a furiOtion of prior 
vtriablt viluti> ^rtqutno'y do««in aodtlt give functional 
rtla^ionthipi bttwttn tht variablt and a fixed frtr^uenoy 



oo«pontrjt. 

\ 



PERIODIC WAVES 

Any p«U«hp of v«lu«i of " a vtritbl* -which r^ptttt 
rtfultrly ovtr ti«# is \ ptriodlo wavr^ A aimple examplt is 
. tht ooflnt* wtv« ^ 

y « .f(t) . R ooi(wt ♦ p) 
R ia th« a»xlauffl tapUtudt of the w«vc, w is the frequency in 
r»tJUn« pej:- unit tiae, •nd p is the phase angle with respect 
to the origin/ ^Ms phase kngle is also exprefsec^ in radians. 
If p ia «ero,» the aiaximuin ,va/ue of y will fall at t«e; 
otherwise, tKe time origii^wiU intersVfct the periodic curve It 
a<jb« other point. The iiACiii. of tMs wav'e, or the time it 
takes to complete a singly complete cycle, is '2 pi / w. 

It can be shown that any ^set of dpta can be represented by 
a series .(possibly infi^^ite) of sinusoid function's Jiimilar to 
the above (Kreysiig, 1972'). This remarkably general 
statejient, which applies not only t€>' data whi-ch is clearly 
periodic ' over time, bu.t to fuTiOtions (even ones with 

disoontinutJ).es, U-ke step or impulse functions), is generally 
teraed a. Fourier series representation of a function. It is 
•naaed for Jiaeph Fourier (1768-1830) who was inf trumont;|;^.in 
developing' l#n^ ia now generally c^lef^Pouriir anaiysis, i>. , 
the r#pre4itntetio>v of oooplex dynaaic fui^otions as a saries of 
sinusoid ana ooaihej functions. 
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Con«idtr th« •xaraple pariodio wave plotted in figure 1. 
It consists^of 100 time points of a square wave with period 50, 
or two oottplete cyolea. Sinoa ona oyola takas a pariod of 50 
titaa units, ard correspohda to - 2 pi radians, tjie furdaoental 
'fraquancy oY this wave is 2 pi / 50 « .1256iS radians per time 
unit. 

Fourier procedures (diaouaaed later in this paper) show 

that a^wave such as this oan be represented by odd .h'araonios of 

tha fundaaiantal fraqabnoy. Let us remove thaae ooaponents one 

at a time, look at the residuals, and gradually liuild: up a 

«odal of tha aquara wave. Figure 2 shows the original data 

with a sinusoid of the same fundamental fraquenoy fit to the 

data under a least squares restriction. Note that this 
\ 

fufidaaental fit is a reasonably good approximation of the data' 
(the F-vaiue for explained variance is over 450!) by itself. 

Figure 3 . plots the residuals when .the fundamental 
frequency is removed. Figure 4 shows "the third hamonio { w « 
6 pi / 50) fit to the residuals, and Figure 5 ahowa the 
r/aiduals after both the fundamental and .third harmonic are 
removed. The fifth harmonic (w « 10 pi /$0) is fit to theae 
residuala in Figure 6, and the final set of reaiduala plotted 

: > y ' ' ' ■ ' 

in Figure 7.-^.' 

rigurw 8 illustrates the.pcAht to be m«da hart. When only 

trte flrat three periodic components of. the square wave are 

oomoined, we have a vary good approximation of the 'original 



d»t*. Addition of further ooapcsin^nts will i«prov« the fit, 
although ooh will oontribut* proportionally l«8s to th« 
goodness -of fit. 

It is difficult to acWel this dtts with ti«e domain 
prooetiures, because of th« disoontinuitiea in the data, yet 
frequency domain analysis will prpduoe this ai^le model: 

y • 3l^MA [ A oo8(kwt) ♦iB sin(kwt) ] 

In this dat^ set the ooeffioients A and B mu»t be 
^atiaated to give a least squares fit, and in a .realistic 
situation w auat alao bt estimated. The derivation of these 
procedures, due primarily to Bloomfield (1976) and Kreyaxig 
i^9^2) will next be addressed. 

^ ' ■■■■■■ 

CRKATINO PERIODIC MODELS ' 

i 

LtAAL'^flUATAA £«tj.matlQns of Ampliti^g inM PJ&um 
.Consider the time series data 

X « [x ,x ,x , . . . ,x ] 

T 2 3 N 

We oan model any periodic ooaiponent in the data as 
X « mu^> IJ cQsCwt -f -f e 

: t ■ ■■ ■ . . t •« 

■ ■♦ ' 

Where x is any data point in Wit series 

^ '.- . ■ ■ 

mu is the m«an of tht':»«rieB 

^ R Is th« mmxlmua amplitude 

w is th« f»*fquenoy in radians ptf unit time 

p is thtf phase angle in radians 

t is time units- ^ 

e is residual error 
■ t 



fieaidual error terms are assumed to be urcorrtl atkl , and all 

discrete observations of x are taker, at equal time intervals. 

For a series of discrete data points the sum of squares of the 

residual values «u»t -be minioizeS. To facilitate tf^is, the 

equation is rewritten by trigonometric ident^y. as 

X « mu -f A coa(wt) i 8 sin(wt) + e 
t , * - t • 

Where A « R G03(p) 
8 » -R sin(p) 

The function to be minimizeiSl is then 

■ N • 2 

^S(mu,A,B,w)«SIGMA ( (x - mu - 'A oos(wt) > B sin(wt)) ] 
t«1 t 

The procedure fbr minimizlrjg this fur-ction for any w is to set 
the partial derivatives pf SS. with , respect to mu, A and B equal 
to zero and solve for the estimated values of each parameter'. 
Bloorafleld (1976) provides a very readable presentation of this 
derivation. For any fixed w, the least squares eatimatis for 
au, A, and B are approximately given by 

X « 1/N" SIGMA [x ] 

\ t . 

R « 2/N SIGMA [(x - 2)^cos(wt) ] 

t 

B « 2/N SIGMA [(x - 2) sin(wt) ] 

Vt 

The exact solutions for X and B are given in Bloocafield, and 
are oooputafcionally difficult. The above approximations are 
generally accurate* to at least ; four significant tfigits. thin 
is more than suffioieht for most applioationa. The estimate of 



» 



th<|' »«an, 8, * it th« simple «rithai«tio mtan of the aeKiea. In 

th^ai and all further formulae, SIQMA will be aajumed to ooyor 
thf range 1 to N if the range ia unapeoif ied . 

. i 

fiaiiaating m ' ^ ^ 

i ■ 
I K and fi have additional ppropertieal^^ich make them uaeful , 

in ^ eatlaating the w which will best fit the, data. The 
'amplitude R of the fitted periodic component is eatiraated by 

2 2 ' 2 . ' ' 

R(w) s A{w) -f B(w) ' 
r 

The freq^•noy is optimized when the magnitude of the fitted 
comppnent is maximum. '.The squared magnitude is usually 
referred to «» the periodogram; although strictly speaking the 
true periodogram values differ by a scale facto^ of N / (8 pi). 



le factor^ of i 
at frequ4noie! 



The amplitude of ^,he periodogram at frequtnoies ne>ir the 

*^ 

strongest periodic component describ^is an inv*rted-U, as in 
^ Figure 9, which is taken from Bloom;field, It is simple to find 
the value for w • which maximises vtha amplitude , in the 
neighborhood of w by numerical- methods, auoh' as Newton's (of. 
Conte, 1965). One must begin near the actual frequency, 
, however, as »<Sst nuiberioal methods will detect only iocal 
makiaa. Por^exarlple ii" on« atte«|^d to optimize the fr*equency 
at af ttarting point of^' w«.23 Lw Figure 9, Newton's Metho<^ 
woUia prctduce a aaxiaiUDi of lest than 10 near the Vrequer.cy of 
.23 raditj^a, when the actual maximum is about 1 10 near w«,2l75. 
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Kstiattes of A, B, and w orfn be found for modtls made up 
1 of ^fflore tr>ar: one periodic cotn^>on«nt. Tht following ' algorithm 
was uaed to oooput* estimates for multipJe ooaponent models In 
this paperL , ' 

1) Set initial estimate* for A, B and w 
"2). Remove these components by subtraotfng out each, 
periodic function in turn, i.e., remove eaoh subsequent 
periodic cb«ponent from the residuals . J eft after removing the 
'•"prior component. — >^ 

3) Compute the mean of the corrected (residual) series and* 
remove this mean from the residuals. 

4) Rstimate new values of A, B, and^w for each component; 
if the new w differsyrom the previous value by more than some" 
criterion, value/ go to 2 and repeat. 

5) When no w value for any of the components changes by 

» 

more than the crite^on value in a complete cycle, the 
optimixation is complete. ^ 

f 

The ♦irtue of this procedure , is that it treats each 
periodic oo«ponent separately. The importance * of this can be 
s^en t^in»p6oting the periodgram in Figur^ 9. Note the' side, 
lobes in the aapUtude produced; by the single 8tror)g periodic 



component, in. a multiple component model, these side-lobes may 
•mttrfere with dttection of j^rimary peribdio components. 
However^ since each component is removed before oo^idering the 
next, this "Isfk'age'^ is of no oonsequence in the parameter 
•sti»«tion or optimiiation process. 



ijL 

a;gnifinin,at ImUat . ."^ 

Affr . .od,l Of periodic coapon.nt, is fit to ob:M»rvd 
<1«U, it ia .ppropri.t. to if it .xpl.in. W« v.riano. 

th.noh.no* .ion. would provide. Sino. .1) p.r*,et.r. ,.r. 
;Oti«.t.d under l...t aqu.r.. con.tr.int,, it la ...y to, o.rry 
5ut..n .n.lyai, of v.riWe for the specified periodic ,odel . 
The 9uaa of aqu.res .re p.rtitioned .3 

3S « SS > SS ' 

X aodel residual 

reaia t«1 t k« 1 k ' " 

N it the number of time points 

M is tM nuaber of components in the model 

All other v.ri.bies are as befpre \ . 

Fno« this p.rtiioning, the .ppropriate F-test la Jvident: 

» / M .nd 

•odel aodel 

^ '33 / (N-M-t) 

rwldual _r«sidu«l 

'^^J N-M-1 degrees 

^ model residual of rre,edo» 

E.oh oo,p6r,..t in t«, «od.l Introduoe, on. degree of fre.<,o,, 

« tyo oor>.unt, (| .nd 6) ,r, being ,.tl«t,d. . J«.t « th* 

..ti«tlon Of lnt.ro<pt .rd .lope Uko oon.t^t.) in llne.r ' 

r.,r.«lon introduo., . d,«r., of fr..do- for ,,oh y.rUWe fit 

to U. d.t..^ Ih. *rror d.gr.,.; of :|r,„do, follow fro» Wl^ 

By u.ir« .ppr^rut. Rif.t, lt*i, po„iM, to d.k.r*ln. 

if.in,!. og«ipon-,r,t. expl.tt algnlflo.nt worn, of „rl.no,, 

t..t the oy.r.u .lirifio.r.o. of,. .„uipl. oo.p«,.r.t >od.l, or 



9«e if' addition of another single ooopohent explains 



significant additioral variance.. 

To test a single cooponert model, the sums of squares are 
obtalcll^d diredtiy from the parameter estimation process, and an 
F-ratiynth 1 and Ni2 d.f. is computed. Multiple oomponent 
ia,<3d«ls first uridergo the paraaeter ^ optimization process and at 
its completion have- F-ratios with M and N-ff-T d.f. ' computed 
^ from .the sums of squares from all components of the optimized 
model ^ This. F-teat gives a , 9^'gnif icande level for 'variance 
explained by all periodic • components taken together. 

.To test the significance /)f adding an additional " component 
to an existing model , an incremental F-ratio is computed -as 

^ « - ss " ) / nss; / (N-M-Di 

inc with new without ; resid 

component component 

Degrees of freedom are 1, and N-M-1. (See Kelly. 1970 or Nie. 
et al., 1975, for a discussion of this procedure, in the 
contex.t of linear regression.) 

An additional statistic of interest 'oa^ bWcoJQputed from 
the sums of iiquare?-; Thi^ statistic is the equivalerjr of the 
Multiple R-Square. of regri-s^lon or Eta-Square' of analysis of 

t 

variance. It will be called E-Square here. to avoid confusion 
with R-Square whidh represents the squared amplUude of a 
periodic vav«. ' B-Square is interpretal?le as the analog of 
Multiple R-Square ,\ ,i .e . , V* ptrcentage iof total , yarianoe " 



explair.«d by lh« ntted model. It is coaputed by 



2 

8 « SS / SS 
% 



' ■ ' , HARMONIC ANALYSIS 

A crltio*! ,,au«»tion triaea in conaidering the d^scuaafon 
of the previoua 'J^jtion, naaely, ^row doea i-one determine the 
^ber of periodic conpor^enta and their atartlng values? Th.e 
answer depends on w^^^ther one is testing a hypothesis about the 
'presence .of components is attempting- to describe a ^set of 



deta. If the former i* thb case'^ there is no problem; 
presumably the theory being tested supplies the description of 
the co-ponerts. , However, if one is attempting X SiSJ^Sju^ 
description, some meana of detecting strong periodic oooponensts 
is needed. 

The first technique which seems obvious is to plot the 
data^and look for periodicitiea. If there are very strong 
periodioitiea, this is an adequate procedure. However, 'the 
human eye is not a sensitive pattern detector, at'leaat when" 
dealinf with data plots. It is aliaost impossible, for eiample. 
to Visually detect statistically significant linear 
relationahipa with oorrelationa of" lea*, than .50. Yet 
detection of , a periodic component which |^plained 10< of the 
variance in a time series might be considered quite important. 

fouriep analyaia provide* a systematic procedure Tor 
dtttotinf periodic oowponents. A full discussion of Fourier 



•nalyaia ia far beyond the scope of the paper. Only a 
coroaptual outline is included here. The interested reader ^ay 
consult a number (?f intermediate mathematics texts, such as 
Kreyseig (1972) or Wylie (I960) for the proofs of the 
atattmefita made here. 

If a sit of frequencies are chosen so that 

w « 2 pi J / 0 1 J 1 N/2 

J r 

it can be showr. that the siriusoicjs corresponding to ' these 
frequencies are orthogonal, and hence independent. Only 
frequencies corresponding to 0 tg pi radians need be^ 
considered, as higher frequencies are indistinguishable from 
their lower hannonios. Th|8 phenomenon is called ♦'aliasing", 
and it implies that the highest frequency detectable in a time 
series has a period of 2t, i.e. a frequency of w « (2 pi) / (2 
t) a pi / t radians per tiaie unit. 

We can define the sine' and cosine coefficients 
correaponding to^the Fourier frequencies as 

A « 1/N SiQHk [ X ] » 2 
0 t«0 . t * 

' N«1 

A « 2/N SIGMA [x oos(w t) ] 
J t«0 t j 

N-1 

B « 2/N SIOMA [x 8in(w t) ] 
: J t«0 t J 

•,'■■■/ 

, - _ ._. , .. 




^ N-1 t 

A « t/N SIGMA [(A) X ] if' N is even 
N/2 t«0 t 

.« 0 if N i» odd 
If N is everi, th^ Uat d*ta point correapondd to a Fourier 
frequency af pi rtdiina, at wbioh the ooaine tero will be 
•ither poaitive or negative unity. If n Is odd, the last data 
point da«d not oorreapond to a Fourier frequency, and thus 
ther« is V^o cosine coefficient. " 

■"Tfolf^that the cosine and sine coefficient* cprrespond to 
the valuta derived for model fitting in the last section of 
this paper .f They have one Important additional property, 
however: they are orthogonal, and*^ hence can \>9 linearly 
superiapoa^d without any interaction aiaong obaponents. Any 
tiiie aeriea can thus be represented as a series of summations 
of Fourier ooeffioienta ♦ ^' " 

N/2-1 ' t 

X « A SIGMA [A C03(w t) B 8in(w t)) ] ♦ (-1) A 
t 0 >1 J J J J N/2 

The A and^ B coeffioienta can. be coaputed at each Fourier 

frequency by evaluating tha above sums. * The resulting- set of 

ooefficienta- it called the Jiliiuyitft^^ of the ( 

time aariea. Cwio^tuaUy, it describes a series of periodic 

oofliponants which, when added together, produce a continuous , 

function irhioh pacs^s through each point In ttie discrete tiae^ 

MPie».: In practice, the A and B ooefflolenti are ooaputed by 

•ore •rrioiffit aethOd* tj^»n tha evalimtion of sums. The "fast" 

rouriir Transfor^'^ developed by Tukey and others (Tukey, 1967) 



ua«^ 4«th«m*tio«l ld«n\itifs • to simplify the, coaputttion 
•prootsaT, , whil« producing the same results. 

Having established that a time series oan be represented 
ft •eriea\of independent periodic oompcnents, w» are V^in a 

r 

poaition to detect important periodicities in a data set by 
observing the magnitudes of the Fourier transform ooef ficienta 
at the Fourier frequencies. These amplitude^, as before, are 
the square r»oot of the sum of the squared sine and cosine (B 
find A) coefficients. 

Figure 10 shows the plot of the amplitude squared of each 

. ■ . / . ■ 

Fourier frequency for the square wave example, Not'e the very 

strong peak at w « .16. This is close to the actual 

fundamental frequency of . 1 3-. If the periodicty was not 

visually evident, it atill ooiild be detected by choosing the 

strongest Fourier component^ then optimizing the w, as 

disGuaeed in the previous > section . 

It might occur to the reader that fitting* a model of k 
components would aiatoly| involve choosing the k strongest 
Fourier coopofients andlising then 'as starting points for the 
previoualjr outlined 'model fitting procedure. Unfortunately, 
the "leakage" problem evident in Figure 9 still exists. Strong 
periodic ddoiponenta produce sid^-lobes in the Fourier 
perioddgraai which may mask weaker independent components . 

Bloo«field (1976) and others have devoted much effort to 
ouiiirjlnf data ajaoothing prooeduret which will daapen the' 
letkM* tfftct. These are very useful if one is restricted to 

. ■^ • ■ . 



•^br.e-paaa" data ariaiyjla, such aa in proces3ir>g raal-tlifle 
sigri«i«. ,How«v«r» ooawuniottion data is usually not eubjtot to 
this r«»triction, so st»r>->fia« proodduras which isolate a 

•inglt ooapontnt, thisn reaova 'Jits effaots ara possitlis. Since 

■>;:.■■*' 

ona oo«pon4fit at a t^aa is procassed, leakage ia'irrelevtnt . 

♦ . * ■ > 



J 



k PrtOCEDURK FOR DETECTING AND MODELING PERIODICITIl 
The following algorij^hm draws upon JUM^ttonic ar^/yais to 

detect periodicities in data, and npdjl tJarameter>«stimation 
^techniques - to fit these periodioiti^rs, in a ^l/ast squares 

fashion to the /ia^e aeries data. 

1) Coiiyute tht diaorete Fourier trans^o^, and choose as 
a starting point th« frequisney ^whioh pro^oes the iargesl 
amplitude. ^ ^ 

2) Optimize the frequency to produce the greatest 
amplitude in t|ie neighborhood of the sArting frequency,, and 
obtain the A and B eatloates for this /requency . If the F-test 
for explained variance is non-signiMcant , this frequency may 
be the spMrious result of other . jreiflodicitits in the data, or 
there aay- be no significant periodic coaponenta. V 

1,3) Reiaove the ooaponentl ^'oaj the original data, using 
tt)4 A and B eatiautea. RepeA, steps 1-3 untli all desired 
^(^mpoeifnta are removed. If component gives significant V- 
values, one »ay conclude UM there are no periodic components 
which are di«tinfuiahli^ y^o« noite. Conponents which ^give 
«i|nificant explanation variance will be u«ed in the final 
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•M*!. Rtawving ooapcM-itnts ««y b« c^ntinu«c! until further 

.ooapofijint* do not product eigniflount F-valuea for explaintd 

v*rlju)<5^ in the reaiduals, until " the additional explained 

variance la - laaa than a criterion value, or*' until a aaxiaum 

number of oo«ponenta have been extracted. • 
♦ ■ 

.. M) Chooae the frequenoiea which gave ^Individually 
significant F-valuea as .starting estimates in building a aulti- 
component model. The frequencies of 'the optimal fit will not 
in general be i(ientic|al to the frequencies -obtained from step- 
wise harmonic analyai's and single-component model fitting, as 
•the set of frequencies obtained from the above steps will not 
be orthogonal'. The final frequencies and A and B coefficients 
from title aod^l fitting. iprocedure^lJky be. tested for significance 
by a^inal F-»ratlo computed for the entire multi-ooopon4nt| 
model. • , 

, Tht procedure outlined above is similar to' step-wise 
regreaaion. ' The strongest component is isolated firsts its 
paraaetera •atiaated, the variance Explained by it removed, 
then the next strongest component chosen, etc. The same 
warnings usually aaaooiated with step-wise regression also 
apply to this procedure. The prder in which the cooppne^ts are 
oho««n wli; affaot the relative variance explaineh by 
•ubaaqutnt ooaponenti; and any oosponent may have been chosen 
ow ar;jSther as a rtault of ohanoe Variation in the data. ^ 
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Th« pr*6ctdJ^^ outl:ir.«c! above is not the only on^i^poaslble. 
An ilternative to, be explored involves using only Fourier 
■frtquencits in models, rather than estimating optimal 
frequencies. This wou|d oapitalire on their orthogonaJ 
properties and ali'ow on* to speak of the oaraponents separately, 
Just as one speaks of orthogonal beta \*ei9hts in regression. 
However, using only Fourier frequencies Magnifies the 
oomplexity of the resulting models, as a number of Fourier 
components may bp necessary to describe' a periodicity which 
doe5 not occur, at a Fourier frequency. 

EXAMPLE Ot HYPOTHESIS TESTING AND MODEL BUILDING 
,Th* following ,ia an example of tne procedur'es outlined 

rove applied to a real communication data set. The data used 

were gathered as part of a mass m»diajyonitoring project. . The 

details of the project and tbe dat» gathering^ procedures can be 

found elsewhere. (Watt, 1977) and will not be discussed here. 

,The time series plotted in Tigure lla-d represents the 

daily prominence of stories about ■ the Concorde supersonic 

transport carried in Washington, D.C. mass media, Including 

network television. The data shows ho obvious periodicities. 

/ 

» 

It waa proposed that mass media coverage is subject to 
thr«« JMkJor oyoles: 

J 

I 
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1) A story "wear-out" oycle of two dtys period. This 
1» an reauW of the eVent-orientttion of the «edi»/ Any single 

, announcement or event will be proajinent only for a short period 
of time and will result in' short bursts of coverage of high 
frequency. 

2) A "weekend" cycle of seven days duration. Few|r 
newsworthy events occ;;r on the weekend, espeoially when a major 
portion of the news evijnts concern governmental annojTnoements, 
as was the. case in the Concorde trials. There is .also less 
breaking news prii^ted on weekends, due to normal staff days- 
off in the news organizations. 

3) ^An "issue" cycle of" about 30 (jays duration. A 
continuing controversy will exhibit recurrent coverage. For 
the issue to reaain on the public agenda, a^poe coverage once a 
month is hypothesised. i 

It must be noted that these are very stringent hypotheses. 
They are the equivalent of specifying the slope of a regression 
line in linear regression. It is more realistic to specify 
approxiitate frequencies of hypothesized oooponents and then 
optiiixe the function fit withih the' neighborhood of the 
frequencies, siaiiar to the regresal6n procedure of adjusting 
the slope for sAxiauai explained variance. 

Table I oontair^s the results of the aodel fit. The thrie 
QOMponents wert first fit to the dajji independently to test the 
hypothttis that each oo«ponent wt« present in the data at 
greater than ohanoe leveli> Aa the table Indicates, only the 
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30 day cycle *pVjro«oh«d signif ia»nce- Th« optiaUtd frequency 
g*v« A 29 day p.riod. This ooaponent ^av* the greatest B- 
Square value of almost .01, equivalent to a Multiple R of .10, 

The lower portion of Table I shows the full test of the 
three ccynpor.ent hypothesis: As would be expected, the 
explained variance is improved onjy marginally over, that of the 
29 day corapor.ent alone. Tha significance of the F-value is 
less than .10, but greater than .05. 

if the alpha level for 'rejection of the null hypotheaii 
was set at .05, as seems normally to be the case, we wou^d fail 
to reject the null statement that there are no periodicities of 
period 30, 7, and -2 days ir. the 'data. If the ;evel was set at 
.10, we would conclude that the iongor cycle of nearly 30 days 
was prasent, indicating an "iasue-oriented" oQverage pattern. 
In any ca'sei only about \% of the variance is explained by the 
hypotheaiMd coaponent(s) . * ' 

Th^ prooetlurea for detecting and modeling- periodicities 
outlintd in the previous section were . oar^iad out on the 
Washington data sat, in an attempt to orat^a a: descriptive 
model of the data. A discrete' Fourier transform of the 
or'iginal data was first obtainad. 

Tha full trariaform ia shown In/faWa ll for illuatrative 
purpoaaa. The oo«putationai jrooaiJurt aatd to obtain the 
tranafora utiliaaa oomplax variable algebra, |0 the trarsfor* 
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is expressed ir. r«*l and ioaginary oooponents-, rather' tnan sine 
«nd ooaine ooef ficients. The squared aapiitude can be obtained 
froffl these coefficients, however, and it corresponds to that . 
Obtained froo A and B coefficients. Since th^ J^nplitude and . ^ 
frequency values are the important ones in model building, 
there is no need to go into the details of the^ complex . 
algebraic procedure except to state that the JJJi Fourier 
^ transform terta, J, expressed in real and imaginary terras is 
related to A and B as defined previously by 

J » 1/2 (A - iB ) 

J J J 

Where J is the Fourier term 

i is the imaginary operator 

(square root of -1 ) 
A and B are the cosine and sine coefficients 

Note that the strongest component in the Fourier transform 

occurs at w « .5^5^, which corresponds to a period of about 12 

days. ^ The amplitude squared of this" component is .5093, 

corresponding to a maximum amplitude of^about .71. 

This component was entered as the ataVting point for 

frequency optimization. Th%^esult8 of this-^single component 

model fit are shown in the* first entry of Table III. - 

Surprisingly, the explained variance was not sigr.ifloant . 

Apparently the actual periodicity in the data was far enough 

from the Fourier frequency that the optimization prooeas 

oonverged on a sid«-lob« frequency, rather than the iotual 

frtqutncy. Thi» la further indicated by the estinwte<|taaximuo 

.•quartd aaplitudt of the fitted function (the sun of the 
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aquarta of the A and B estimates). This value is only .39, 

QOfiaider^ably below the .51 found in the Fourier transform. 

Thia aide-lobe component was removed, from data, and 

the reiiduala a?ain Fourier transformed. Suspicion that the 

first, frequency was^purious was cqnfirmed when, again w « .515^4 

emerged as the strongest Fourier component^ with a squared 

magnitude of .47. Since the spurious frequency had been 

removed, the frequency estimation procedure thia time converged 

to w « .5032 with a squared amplitude of .57, higher than the 

Fourier amplitude, Thia frequency also explained much more 

than chance variation in ^e data, > as the second entry in Table 

lil shofws (over 2i of the variance, p x .003). The period 

eorrespondifig to this frequency is about 12 1/2 days. ^ 

♦ 

This component was removed f rom ' the data (which had 
previously had the spurious component removed), and- the whol^ 
process repeated. Table III summarizes results of the first 
five components. For the purposes of this example, the model 
was arbitrarily • limited to five components. However, 
significant F-ratios for single components could still be 
obtained after 10 cofjponefits were isolated, 

In order to teat the final five component model,- %li the 
frequenoiei obtair*«d above were entered in the frequency 
opt4»4d|k^tiori proceM *aa atarting points. .Tha final model is 
auMuiriW at the bottom of Table III. As it indioatea, the 
mo<i«l fit i» quilt food, wittv. an S-Squmre v*lue of .18, 
equivalent to V*Hultiple R of ov^r .40. If «n additional five 



c'oapon.nts were added tht model, Ir.dioationa are that about 
iO< of the variance in the time aeries could >iave been 
explained with the resulting 10 oocjponent model. 

It should be ntited that the suras of squa^res are computed 
about the mean for the fitted function, rather' than about the 
mean f6r the dat« , so the total ^sums of squares may be 
• different for different models. But the sum^s of squares due to 
fur.ction fl^ varies accordingly, as removal of the constant 
mean constituVesla linear transformation, and thus the F-values 
*re identical. 

The final model consists of the addition of periodic 
■ components of period 12.52, 20.43, 15.21, 9.68, and 683 days. I 
The values predicted by this model are plotted with the ' 
original data in Figure 12a-d. The effect of each component 
can be seen in the plot,, The eight day component , produces 
rapid fluctions which are superimposed on the 12 to 20 day 
conponents. This complex relationship gives several peaks in., 
uje predictive function near peaks in the observed data. The 
683 day oofflponent goes only through half a cycle in the data. 
. It is respor.sible for the rising trend in the" predictive, 
function at either^ end^of the plot. Sinee ^se were periods 
or peik coverage, it accounts for large amounts of ^xiJlained 
variance. 
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CONCLUDING RfclMARKS 
A relatively simple procedure for model ir^ periodic 



components in time aeries data was presented, with an example 



application to communication data. The mathematical orooedures 
outlined have been available for 150 years, and have been in 
use in engineering applications for over 30 years. As 
comaur;ication data increasingly becomes time-based, procedures 
such as- these should become more common, 

A number of topics have not been addressed in thfs paper* 
^ All time series discussed here are* assumed to be equally spaced 
In time, and have Gaussian error terms (uncorrelated error). 
Neither of these assumptions are necessary, with modificajtion 
of the estimation procedures (Bloomfield, 19760. A primary 
topic not touched upon is the use of harmonic models to detect 
relationships betw*»er variables in multivariate time series. 
This is a quite important extension for individuals interested 
in testing hypotheses about -covariation (and possibly 
causation) t^tween two variables measured over time, and as 
such deserves inamediate attention. * 

Finally, it is evident that periodic modeling can only be 
oarritd out wi^h the aid of a oooputeh program, as the data 
•anipulation is extenaive. In *hi8 it ia not unlike wany 
QooMon atatiatioal procedures suoh aa multiple regression, 
factor anal yaia, ato. Tht oooputer program usad to analyse and 
pl6t tha data in thia paper^ is available, v along with a 
doou»«ntation and uaara* surual, fro« the author for a small 
raproductlon faa. * 
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TABLE L , 

Analysis of variancf for hypothesis test of ThREt comfonent model 

WASHINGTON DATA 



Single Component Model Fit3 



Start 


Final 






2 




Sig of 


Frequency 


Frt- quency 


A 


B 


E 




F 






•-.'40982 


-5.4516 ' 


4OO3 


.992 


".320 


.89760 


.896M2 


. 16096 


.2686 


.006 


2.116 


.U7 , 




.21652 


,H1*<13 


.0219 


i009 


^3.167 


.076 



Dcgr€5S of Fret'doh) for"F-rati05 are 1 and 364 



f 

Three Corr.pontnt Kodel 



INITIAL VALUES ARE - 
COhiPONENT FBEqUENCY 
(RADl'^NS)' 

1 3.1*<15901 

2 0.8975971 

3 0.209'»393 

FINAL VALUES ARE - 
COMPONENT FREQUENCY 
(RADIANS) 
1 3.H2501O. 
. 2 0,8965006 
3 0.2165'<23 



COSINE ■» SINE 

COEFFICIENTS 

0.6 0.0 
0.0 0.0 

Q»0 0.0 



COSINE SINE 
COEFFICIENTS 
■0,334536E-fOO 0. 1'^8'4 39E+01 
O.I805O8E4OO 0.2859'41E+00 
0.1^1 19O8E+OO lt),22729O£-01 



FITTED CONSTANT (MEAN) IS 

T.OTAL SUM OF SQUARES IS 

SUM Of SQUARES DUE TO FUNCTION FlTMl 

RESIDUAL SUh OF SQUARES IS 

MEAN SQUARE DUE TO FUNCTION FIT IS 

MEAN SQUARE ERROR IS 

r « 2.U21 WITH 3 AND 362 
SIGNIFICANCE OF F-RATIO IS 0.^98388 



0.69»^9l6&-fOO 
O.36783»4E+04 
O,632766E-f02 
O.361507E+0'< 
0.210922K+t)2 
0.998637E-»-01 
DEO. OF FREEDOM 



E-SQUANE « .01)2 



E « .131 



4 
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TABLE 111 

ANALYSIS OF VARIANCE FOH FIVE COHPONfcNT 

WASHINGTON DATA ^ 



Single Coaponenl hodel Fits 
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Degress of Frccdan for K-r8tios are 1 and 36*^ 
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FITTED CONSTANT (MEAN) IS 
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SUM OF/SgUAFiiS DUE TO FUNCTION FIT IS 

RESIDUAL SUM OF SQUARES IS 

MEAN SQUAHE DUE TO FUNCTION FIT IS ' 
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F » 15.53C6 WITH 5 AND 3OO 
SIGNIFICANCE OF F-RATa6 IS' 0.000012 
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